Random forest predictive modeling of mineral prospectivity with small number of prospects and data with missing values in Abra (Philippines)

نویسندگان

  • Emmanuel John M. Carranza
  • Alice G. Laborte
چکیده

Machine learning methods that have been used in data-driven predictive modeling of mineral prospectivity (e.g., artificial neural networks) invariably require large number of training prospect/locations and are unable to handle missing values in certain evidential data. The Random Forests (RF) algorithm, which is a machine learning method, has recently been applied to data-driven predictive mapping of mineral prospectivity, and so it is instructive to further study its efficacy in this particular field. This case study, carried out using data from Abra (Philippines), examines (a) if RF modeling can be used for data-driven modeling of mineral prospectivity in areas with few (i.e., <20) mineral occurrences and (b) if RF modeling can handle evidential data with missing values. We found that RF modeling outperforms weights-of-evidence (WofE) modeling of porphyry-Cu prospectivity in the Abra area, where 12 porphyry-Cu prospects are known to exist. Moreover, just like WofE modeling, RF modeling allows analysis of the spatial associations of known prospects with individual layers of evidential data. Furthermore, RF modeling can handle missing values in evidential data through an RF-based imputation technique whereas in WofE modeling values are simply represented by zero weights. Therefore, the RF algorithm is potentially more useful than existing methods that are currently used for data-driven predictive mapping of mineral prospectivity. In particular, it is not a purely black-box method like artificial neural networks in the context of data-driven predictive modeling of mineral prospectivity. However, further testing of the method in other areas with few mineral occurrences is needed to fully investigate its usefulness in data-driven predictive modeling of mineral prospectivity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Delineation of podiform-type chromite mineralization using geochemical mineralization prospectivity index and staged factor analysis in Balvard area (SE Iran)

The aim of this work was to delineate the prospects of podiform-type chromite by staged factor analysis and geochemical mineralization prospectivity index in Balvard area, SE Iran. The stream sediment data and fault density were used as the exploration features for prospectivity modeling in the studied area. In this regard, two continuous fuzzified evidence layers were generated and integrated ...

متن کامل

Random forests algorithm in podiform chromite prospectivity mapping in Dolatabad area, SE Iran

The Dolatabad area located in SE Iran is a well-endowed terrain owning several chromite mineralized zones. These chromite ore bodies are all hosted in a colored mélange complex zone comprising harzburgite, dunite, and pyroxenite. These deposits are irregular in shape, and are distributed as small lenses along colored mélange zones. The area has a great potential for discovering further chromite...

متن کامل

Women and Vegetable Production in Abra, Philippines: Benefits and Challenges

There is limited literature on how to engage the rural women in agriculture and improve their contributions to household food security and income. This study aimed to contribute to literature on women engagement in agriculture through vegetable production using good agricultural practices. The empirical data used were drawn from technology demonstrations and experimentation, learning fields, an...

متن کامل

Classification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest

Background & objective: Microarray and next generation sequencing (NGS) data are the important sources to find helpful molecular patterns. Also, the great number of gene expression data increases the challenge of how to identify the biomarkers associated with cancer. The random forest (RF) is used to effectively analyze the problems of large-p and smal...

متن کامل

Predictive Risk Mapping of Leptospirosis for North of Iran Using Pseudo-absences Data

Leptospirosis is a common zoonosis disease with a high prevalence in the world and is recognized as an important public health drawback in both developing and developed countries owing to epidemics and increasing prevalence. Because of the high diversity of hosts that are capable of carrying the causative agent, this disease has an expansive geographical reach. Various environmental and social ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computers & Geosciences

دوره 74  شماره 

صفحات  -

تاریخ انتشار 2015